EN FR
EN FR


Section: Partnerships and Cooperations

European Initiatives

FP7 & H2020 Projects

CompLACS
  • Type: FP7

  • Defi: Cognitive Systems, Interaction, Robotics

  • Instrument: Specific Targeted Research Project

  • Objectif: Cognitive Systems and Robotics

  • Duration: March 2011 - February 2015

  • Coordinator: John Shaw-Taylor

  • Partner: University College London, University of Bristol, Royal Holloway, University of London, Radboud Universiteit Nijmegen, Technische Universitat Berlin, Montanuniversitat Leoben, Institut National de Recherche en Informatique et en Automatique, Technische Universität Darmstadt

  • Inria contact: Rémi MUNOS

  • Abstract: One of the aspirations of machine learning is to develop intelligent systems that can address a wide variety of control problems of many different types. However, although the community has developed successful technologies for many individual problems, these technologies have not previously been integrated into a unified framework. As a result, the technology used to specify, solve and analyse one control problem typically cannot be reused on a different problem. The community has fragmented into a diverse set of specialists with particular solutions to particular problems. The purpose of this project is to develop a unified toolkit for intelligent control in many different problem areas. This toolkit will incorporate many of the most successful approaches to a variety of important control problems within a single framework, including bandit problems, Markov Decision Processes (MDPs), Partially Observable MDPs (POMDPs), continuous stochastic control, and multi-agent systems. In addition, the toolkit will provide methods for the automatic construction of representations and capabilities, which can then be applied to any of these problem types. Finally, the toolkit will provide a generic interface to specifying problems and analysing performance, by mapping intuitive, human-understandable goals into machine-understandable objectives, and by mapping algorithm performance and regret back into human-understandable terms.